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Abstract 

This report addre<i$es the nature of the knowledge people possess about word nieanings, and how this 
knowledge is acquired and used in reading comprehension. A ^standard model' oi word meanings is 
outhned, which equates word meanings with crilerial features, or necessary and sufficient conditions for 
serious, literal use of a word. This mcdel underl'js both current common sense thinking about \/ord 
meanings and much of current pedagogical practice in vocabulary instruction. Arguments against var»ous 
aspects of the standard model by linguists, philosophers, and psychologists are reviewed. Although many 
points are still being debated, it is concluded that the current consensus of scholars can be best 
characterized as the rejection of the standard model. 
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WORD MEANINGS 



Words have often been called slippery customers, and many scholars have been distressed by 
their tendency to shift their meanings and slide out from under any simple definition A goal 
of some clear thinkers has been to use words in more precise ways. But though this is an 
excellent and necessary step for technical jargon, it is a self-defeating program when applied to 
ord nary words. It is not only that words are shifters; the objects to which they must be applied 
shift with even greater rapidity. (Labov, 1973, p. 341) 

This report addresses the nature of the knowledge people possess about word meanings, and how this 
knowledge is acquired and used in reading comprehension. Drawing on philosophy, linguistics, and 
psychology as well as education, this report attempts to describe gaps in knowledge and controversies 
as well as to marshal the reasoning and evidence for what can be ^cepted as truths. Although we 
have incorporated some original thinking of our own, and have been contentious about some issues, our 
fundamental purpose has been to present a state-of-the-art synthesis. 

Some Basic Distinctions 

As Labov said in the passage quoted above, words, and in particular, their meanings, are slippery 
customers." Words about word meanings are no exception; if anything, they have proved more slippery 
than most. In a field as old as the study of word meanings, it should come as no surprise that the words 
used to talk about meanings themselves have many meanings. Thus, no discussion of word meaning can 
proceed fruitfully without a definition of terms. We will use meaning as an everydav, pretheoreiical 
term, and sense, reference, connotation, and denotation as technical vocabulary. We would not care to 
say that all of these terms are needed to adequately characterize word meanings. They are needed, 
though, to talk about the distinctions maintained by philosophers, linguists, and others who theorize 
about semantics. 

We will define the reference of a word as the thing or things "picked out" by the word on a particular 
occasion for use as, for instance, the word dog in the sentence. Vie black dog looks mean, in a situation 
m which there are several dogs, one of which is black. Of course, the words in two or more utterances 
ha\e the same reference if they pick out the same thing. 

This occas* n-specific use of the term reference is now fairly common (cf. Lyons, 1977), but it must be 
distinguished from an older and even more common use to mean all of the things a word might stand 
for; in this usage dog would be said to refer to all dogs. To maintain a distinction between specific and 
general reference, wc will use the traditional term denotation to indicate the entire class of cntilier 
associated with a word. (An alternate term for the entire class that w- could have chosen is exter ton.) 
Of course, the reference and the denotation of a word can be identical, as in Fm not afraid of dogs. 

The construct denotation applies most felicitously to concrete nouns; for example, the denotation of 
apple is the set of all apples. The construct is extended by analogy to other types of words; for example, 
the denotation of red am ht defined as the set of all red objects; the denotation o( migrate as the set 
of all instances of migrating. The denotation of a word is the set of all potential referents for a uord, 
imaginary as well as real. Thus, we take the position that the denotations of unicorn and gnffin are 
different, even though both sets happen to be empty in the known universe. 

Notice that a person's internal representation of the denotation of a word could not be just a list of 
members in the set, because the denotations of most words are indenumerable. How, for instance, 
could apples from next year's crop be listed? Instead, what people must have in their heads is some 
basis for determining membership in the set. We will use the traditional term connotation (we could 
have used intention) for the distinctions, or rule, for deciding whether an object, action, or property 
belongs to the set that constitutes the denotation of a word. (This use of connotation should not be 
cor.fusei with the everyday meaning of affective coloration.) 

We will defin". the sense of a word as the distinctions the word conveys in a particular circumstance of 
use. A more common usage is to equate sense with connotation as we hav3 just defined it, that is, as 
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the distinctions which it is supposed arc conveyed on any and all circumstances in which the word is 
used in a serious, literal-minded fashion. 

The four terms we have introduced are related according to the paradigm below: 



1 

{ Language 
! Function 

i , 


One All 
Occasion Occasions 


Pomtmg 
Attributing 


f 

reference denotation j 

1 

sense connotation j 



Sense and reference are context-bound whereas denotation and connotation are context-free. Ordinarily, 
but not always, the reference of a word is included within its denotation. The sense and the connotation 
of a word are rekted but, if we are right, they are seldom identical. 

Although all four of these concepts may not be necessary for a satisfactory characterization of word 
meaning, either the sense/referencv-. distinction or, alternately, the connotation/denotation distinction 
IS required. The distinction is necessary to account for the fact that two expressions can refer to the 
same individual or object, and yet clearly be different in meaning. In a classic paper, Frege (1892, 
translation, 1966) made the case for the distinction using the sentence The morning star is the evening 
star. Of course, the referent of the two expressions is the same; both refer to the planet Venus. Yet 
the meaning--more precisely the sense-is different; one is about a heavenly body visible in the morning, 
the other about a heavenly body visible in the evening. Hence, the sentence is (potentiailvj informative 
instead of being tautological and, hence, uninformative. For another example, compare tnangle and 
closed three-sided figure. The two expressions denote the same geometric figures. But triangle brings 
to r d angularity whereas dosed three-sided figure highlights, promotes, or calls attention to the feature 
of sidedness. Contrasts such as these demonstrate that the distinction between sense and reference is 
a real one, and contribute to the case that both concepts are necessary to adequately represent word 
meanings. 



Meanings of Different Types of Words 

Any reasonable theory of word meaning has to take into accoimt the fact that there are fundamentally 
different types of words. An obvious distinction is that between function words and content words. 
Whatever sort of meanings one wants to attribute to the words i/ and r/ie will be different in nature from 
the meanings of words like ostrich ox panic. Likewise, words like Hello and Gesundheit, to the extent 
mat they could be said to have meaning, have meaning of a quite different sort. 

We adopted as a basic defmition of the connotation of a word the distinctions, or rule, for deciding 
whether an object, action, or property is a member of the class of objects, actions, or properties that 
constitutes the denotation of the word. However, this definition requires some refmement, in order to 
deal with problems posed by different types of words. 

Proper names are a case where some refmement is needed. On the one hand, proper names are used 
generatively; seen from different distances, from different angles, or on different occasions, even the 
same individual does not look exactly the same. In that sense, one has some sort of strategy or rule for 
recognizing an individual. 

On the other hand, whatever means people may have for recognizing an individual, these do not 
constitute the meaning or sense, of that individual's name. Traditionally, therefore, proper names have 
been said to have reference, but not sense. They refer to specific individuals, but the reference is bv 
convention, not by rule. 
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One way to clarify what coLidtutes the connotation of a word has been on the basis of necessity. That 
is, the connotation of a word is laken as the set of necessary and sufficient conditions for inclusion in 
the denotation. This provides a basis for the claim that proper names do not have sense. For example, 
your friends may recognize you on the basis of the size of your nose, the color of youi hair, and the 
presence or absence of a mustache; but cha**^ any or : of these would not change your name. More 
generally, no particular fact atx)ut you is necessarily true; therefore, no such fact can be considered part 
of the meaning-that is, the connotation-of your name. 

On the same basis, it can be argued that there are other categories of words which do not have 
connotation, as we have defined it. In particular, Kripke (1972) and others have argued that this is true 
of natural kind terms. For ex;imple, the word dog denotes, by convention, a particular species of animal 
There are various ways we can distinguish dogs from other species of animals, but none of these are the 
meaning, or connotation, of the word dog. Changes in the breeding of cats and woodchucks might result 
m a world in which the only reliable test for doghood was careful chemical testing of chromosomes; but 
the meaning of the word dog would not therefore come to be defined in terms of such a test. The word 
dog simply denotes, by convention, a particular kind of being; the nature of that kind of being, and the 
tests used to distinguish that kind from others, are part of our knowledge of the world, but not part of 
the meaning of the word. 

Green (1984) argues that what has been clain ed about natural kind ^erms applies as well to many other 
types of words, so that most words should not be considered to have connotation, as we have defined 
It. However, Green docs not extend this claim to all words in the language; she agrees that there are 
some words which refer, not simply by convention, but by describing. 

It is clear that phrases can refer by describing rather than by convention. For example, someone can 
refer to a given individual either by name (Grover Cleveland) or by a description (the onlv person to 
serve two non-consecutive terms as president of the United States). According to Green, some kinds 
of words, ^!iose which Putnam (1975) calls oue-criterion words, also refer by describing, for example, 
bachelor, or orphan. In the case of dog ox gold, it is reasonable to say that there is a certain kind of 
entity, conventionally denoted by a particular name, and that knowledge of particular facts about these 
kinds (e.g., that gold has a particular atomic weight, or that dogs are mammals) are knowledge of the 
world, not knowledge of the language. In the case of orphan, however, it cannot reasonably be said that 
there is a particular kind of mdividual, and that among the facts we happen to know about members of 
this kind is that they don't have living parents. 

In summary, the point is that one cannot expect a simple, unified theory of word meaning to work foi 
all types of words. As Putnam says. To look for any one uniform link between word or thought a^d 
object is to look for the occult" (1975, p. 290). 

The Standard Model of Word Meaning 

The "standard model" of word meaning, briefly stated, equates word meaning with connotation, and 
defines connotation in terms of the necessary and sufficient conditions for inclusion in the denotation 
A primary purpose in this report is to explore the properties of this model, then present a vanetv of 
arguments exposing its weaknesses and shortcomings. 

It might seem that we are not or!y setting up a straw man, but beating a dead horse: It vvould be 
difficult to find anyone at present who would espouse the entire "standard model" as we will define it 
Ho-"tver, we consider this mode of presentation wo thwhile, for three reasons: 

First, dissatisfaction with the standard model is perhaps the most accurate represent. Mon of the current 
consensus. That is, scholars concerned with word meanings conrur in their rejectioL of aspects of the 
standard model, but not on any model to replace ir. 

Second, the horse is not dead, or, to switch figures, the stra-y man is still very seductive. To some 
extent, the standard model we define comes close to reflecting a "'common sense** view of word meaning 
that forms the implicit basis for much of the thinking that is done about word meanings, especiallv in 
applied areas such as reading and vocabulary instruction. 
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Third^ although it might be hard to find anyone who would hold to all the attributes of the standard 
model, for most of the individual attributes we vaU be able to find contemporary proponenis. 

We want to make it clear that we are looking for a osychologicaUy real and pedagogicallv relevant model 
ot word meaning; thus, the staLdards we set for a theory of word meaning are not necessarily those 
adopted by all of its proponents. Nevertheless, contemporary proponents of various aspects of the 
standard model mclude psycholinguists as weU as philosophers, and, as we have said, whatever its 
origins, the standard model of word meaning continues to influence educational practice. 

The core of the standard position is that word meanings can be characterized in terms of criterial 
leatures that is, neccssar, and sufficient conditions for inclusion in the denotation of a word This 
means, for example, that the ability to fly could not be part of the meaning of the word bird, because 
It IS neither necessary nor sufficient; not all birds fly, and not everything that flies is a bird. The notion 
ot critenality foUows from assumptions of abstractness and parsimony. These two interrelated properties 
are funaamental to the standard model of word meaning; we discuss them in turn m the following 
sections. " 

Abstractness 

Learning word meanings entails more than acquiring name- for a large number of individual persons 
objects, actions, and qualities. Even the concrete object names that constitute a large proportion of 
children s first words are not names for individuals, but for classes of objects. The overgeneralizations 
sometimes observed m children's language, for instance, a child using the word doe fo.- a cow are 
interestmg m the way that they may reflect differences betv^een child and adult representations of word 
meanings. But they a/e also interesting in that they highlight a more basic property of uord meanings 
Knowing what a word means is somehow to know a generative rule, a rule that aUows a person to map 
a given sequence ol sounds or letters on to a potentially infinite but still restricted class of possible 
reterents. Overgeneialization errors of the sort children make are not like mistakenly calling your friend 
Mike by the name John. Kather, chey reflect the child's attempt to induce a rule. 

Some level of abstractness, then, is necessary to account for thf ability people have to apply words to 
novel mstances, for example, to recognize an Irish setter as a dog, even though they may never have seen 
a dog of that variety bt'/ore. What is in question is the degree and nature of the abstraction in the 
internal representation of word meanings. There are three ways that information about a word meaning 
could be stored m the mental lexicon: 

1. Knowledge of a word's meaning is stored exclusively in the form of a rule or generalization defining 
the set of entities or events to which (his word can be applied. No information about individual 
examples is stored permanently in the mental lexicon. 

2. Knowledge of a word s, meaning is stored exclusively in terms of a set of examples of ihe use of that 
word, along with the situations in which these examples are embedded. No rule is stored, but 
ordmarily one can be quickly derived from the examples when needed to interpret a new use of the 

3. Knowledge of a word's meaning is stored both in terms of examples, and in terms of a rule, perhaps 
an incomplete one, that helps determine the set of possible uses of the word. 

These three alternatives lead to somewhat different predictions about how people will process word 
meanings and what .sort of relationships among word meanings can be expected in the language Of 
course. Alternative 1 is embodied in the standard model. 

Parsimony 

The level of abstraction postulated for word meanings depends in part on the extent to which parsimony 
of representation plays a role in one's theoiy of word meaning The standard theory comes doun on 
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the side of maximal parsimony, this is inherent in the equation of word meaaing with a set of necessarv 
and sufficient conditions. 

The commitment to parsimony is a pretheoretical assumption that reflects a belief about what counts 
as an elegant theory. Parsimony has always been a criterion applied to scientific theories. Explicit 
formulation of this criterion goes back at least to William of Occam (1300-1349). It is an axiom of 
scientific inquiry that the broader the sweep of explanations the better, and that redundant, piecemeal 
explanations should be avoided. In other words, according to the criterion of parsimonv, principles 
should be stated in the most general form allowed by the data. 

However, there is no convincing a priori reason to assume that, in representing word meanings, the 
human mind avoids redundancy and strives for parsimony of representation. The question that we shall 
raise is whether the data on words and their meanings allows as much parsimony as the -standard model 
supposes. 

Context Variation in Word Meaning 

The criterion of parsimony becomes problematical as soon as one tries to accouni for differe it 
meamngs, or shades of meamng, that words display on different occasions. A number of psycholincuistic 
studies appear to show that the meaning of terms can vary according to context. For example, consider 
the meamng of the term nurse in the following two sentences: 

Nurses have to be licensed. 
Nurses can be beautiful. 

The first sentence emphasizes the fact that a nurse is a health professional whereas the second 
emphasizes the femaleness of nurses. Anderson and Ortony (1975) obtained evidence of the 
psychological reahty of this difference in emphasis. They found that doctor was a better cue for recall 
ot the first sentence while actress was a better cue for the recall of the second. 

Considering results such as the foregoing, Anderson, Pichert, Goetz, Schallert, Stevens, and frollip 
(1976) propose that 

a word does not have a meaning, but has, rather a family of potential meanings. When 
comprehended m context, the meanings of the words in an utterance are further articulated in 
a process of inferential interpolation based on schemata which embody one's knowledge of the 
world. The effect with respect to nou^ is usually to hmit the scope of reference to a subset of 
the cases which would otherwise be denoted. If the context is rich and if the message is 
processed deeply, a noun may be identified with a single real or imagined thing. This process 
will be caUed instantiation ... [A] close analysis will show that a word can have a somewhat 
different sense m each use. (p. 667) 

Johnson-Laird and his coUeagues (Tabossi & Johnson-Laird, 1980) have also found evidence for the 
psychological reality of context-based variation in meaning. However, Johnson-Laird (1981 1983 1987 
see also Perfetti & McCutchen, 1986) has objected to the claim of Anderson and his colleagues that 
both the sense and reference of terms depend upon context. Johnson-Laird argues that except in the 
case ot genmnely polysemous words'-for instance, bank, which can mean a financial institution, the side 
ot a river, or to tilt and turn an airplane-the sense of words remains fixed on every occasion of use and 
only the reference changes. 

Johnson-Laird's position is motivated by a desire to maintain parsimony in the representation of word 
meanings. He asserts this explicitly, wher he says. 

In fact, there has been too much emphasis on polysemy and in consequence a mistaken view 
about the mechamsm of mstantiation . . . [T|he crucial psychological criterion is whether or not 
It is necessary to postulate more than one semantic representation for a word in order to account 
for the mterpretations of the sentences in which it occurs. Instead of asking how many different 
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meanings can be squeezed out of the word, psycholinguists need to ask what is the minimum 
number of different senses that are necessary to cope with aJl of its different uses. (p. 196) 

Continuing, Johnson-Laird writes, 

If eat were truly polysemous then the sentence He eats the food should be highly aiubiguous. It 
should have many wholly distinct senses. Yet it remains unequivocal. What is true, however, is 
that the sentence in common with others can be truthfully asserted of an infinite number of 
different situations: he can refer to any male individual;/<>oJ can designate an indefinite number 
of different ways from chewing it like cud to straining through the teeth. This indeterminacy of 
reference is not sufficient to establish ambiguity because, if it were all oj^en-class words would 
be ipfmitely ambiguous and their meanings could not be contained by a finite brain. Hence, the 
sentence above, which truly applies to a variety of situ '.ons, is referentially indeterminate, but 
not ambiguous. Its syntax is unambiguous and its woras are unambiguous; they each nave in 
ordinary usage a single sense, but these senses suffice, as do the senses of all words, to embrace 
many different situations. The sentence requires only a single representation of its meaniniz 
(pp. 196-197) ^ 

Thus, Johnson-Laird's explanation for shifts in meanings of words is that the reference, but not the 
sense, of a word may vary from context to context. With respect to the sentences about nurses shown 
above, presumably Johnson-Laird would say that the set of nurses referred to in the first sentence is not 
identical to the set of nurses referred to b the second sentence. For instance, he might point out with 
regard to the second sentence that some nurses are male. Notice, though, [hat if one were to say about 
a male nurse. That nurse certainly is beautiful, it would not be for the purpose of drawing attention to 
his status as a health professional! 

But rather than quibble over debatable cases, let us consider contexts in which the reference of certain 
terms is undeniably the same: 

All the nurses at Massachusetts General Hospital are licensed. 
All the nurses at Massachusetts General Hospital are beautiful. 

These uses of nurse differ in sense even though the reference is identical. 

One way to save the concept of fixed word connotation, and at the same time explam the rich 
interpretation usually made of words in context, is to distinguish sharply between linguistic knowledge 
and world knowledge. In this realization, connotation is pure linguistic meaning, or core meaning; it is 
the decontextualized and presumably invariant concept associated with a word. And sense (as we are 
using the term) is what might be called the contextual meaning; it is connotation, onto which are 
embroidered inferences based on the circumstances of use and relevant world knowledge. 

Thus, it may be possible to postulate a sjigle connotation or core meaning for nurse, and to account for 
the type of variation in the above examples in terms of some general prmciples for incorporating 
inferences based on world knowledge, that would also apply to analogous cases involving words like 
secretary, receptionist, doctor, carpenter, and so on. In some cases, then, a distinction between 
connotation, or core meaning, on the one hand, and sense, or contextual meaning, on the other, may 
make it possible to maintain a single parsimonious representation of word connotation, even in the face 
of what appears to be genuine contextual variation in meaning. 

There aie two rejoinders to this (apparently successful) attempt to preserve the parsimony of 
representations of word meaning. The first is to point out the steep price of parsimony. The notion of 
core linguistic meamng is now at least one more step removed from the data about meaning gathered 
m experiments like those of Aadcrsoa et al. (1976). As we wiU attempt to show in detail later, 
ultimately the notion of literal core meaning can only be maintained by divorcing this notion from 
phenomenological and experiential senses of '^meaning.** 

The second response is that while distinguishing core meamng and contextual meaning mav seem to 
woik m the examples just discussed, it will not wo. s in a large number of other cases. Consider the 
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problems that arise when one attempts lo arrive at any fixed representation of the meaning of the verb 
^ve. The first connolation of this word listed in Webster's Third New International Dictionary (1964) is 
"to ronfer o-A-ncrsiiip of somethmg without receiving a return." This definition works just fine with John 
gave Mary a present, bur already there is a problem with John gave Mary $10 and she gave hun $2.57 
change. The putative fixed connotation fails to cope with the fact that "receiving a return' in goods or 
service, as well as a return of change, is expected in this context. The problem is even more acute m 
John gave Mary a kiss. Gave seems to be used here in a perfectly ordinary way, but does one really want 
to mean that John "conferred ownership" of a kiss? 

Examine most entries in a large dictionary such as Webster's Third New International and you will see 
hoiv uuwieldy the concept of a fixed, context-free word connotation is. Webster's recognizes that give 
IS, m Johnson-LairJ's phrase, "genuinely polyscmous" by according it two main entries. The secondary 
entry lists two related senses, (a) "tendency to yield to force" and (b) "state of being springy." The 
primary ,ry begins with "to confer ownership ..." and continues with no less than 56 other related 
senses in 14 major groupings, as well as a number of idioms. 

The issue of parsimony must be looked at in terms of degrees and tvpes of relatedness among words. 
Everyone would agree that there are instances of homophony--for example, the three different meanings 
of ^an/c-that must be described by postulating distinct connotations. Similarly, we would agree with 
Johnson-Laird that there are some instances of contextual variation in meaning that might be 
characterized in terms of a fixed core meaning, and principles of inference and instantiation that result 
in a more elaborated, context-specific sense. However, there remain cases of related meanings, such 
as the subentries listed under one of the main entries olgive. 

The principle of parsimony of representation, applied strictly, makes the following claim: Any two 
senses which are genuinely related (i.e., not instances of homophony) can be treated in terras of a single, 
more general, core meaning, and general principles of inference and instantiation that account for 
contextual variation in word meanings. We argue that this claim can be falsified in many specific cases. 
Not only that, but we argue that a broad look at the English lexicon shows that It is more characterized 
by redundancy than by parsimony, and that parsimony is at most a secondary consitlerafion in 
dct-rminmg the nature of semantic representations. Therefore, even when a parsimonious 
representation of word meaning in terms of a single core meaning is possible, such a representation 
generally is not the most psychologically realistic. 

The Family Resemblance Model of Word Sense 

It IS instructive to consider some of the senses listed under the primary entry for give in Webster's: (3d) 
"to administer as a medicine," for example, g^ve her a shot of penicillin: (4c) "to perform the action 
necessar>' or appropriate for a public performance or production," for tx^m^Xt, g^ve a concert', (6a) ''to 
yield or furnish as a product, consequence, or effect," for example, the candle gave its final flicker, (8a) 
"to deliver or deal by some bodily action," for example, pve him a shove; (9b) "to deliver verballv," for 
example, give a warm greeting, (10c) "to make known, or impart knowledge or information about," for 
example, give a valid argument; (120 "to allow to have or take," for example, give him permission. 

All of these senses of g^ve are related, to be sure. What is not true, though, is that the senses are 
related because they embody a single fixed connotation, or core meaning. If the sense of the different 
uses of the term were identical, it would be possible to substitute the same synonym in each expression 
and preserve the meaning. However, you can say set forth a valid argument, but you cannot, in anv 
normal situation, say set forth a warm greeting, you can ^^y grant him permission but you cannot sav grant 
him a shove. 

Instead, the relationship among the senses of^V^ in various contexts is better characterized as one of 
"family resemblance," to borrow Wittgenstein's (1953) insightful metaphor In a human family there is 
a greater or lesser degree of resemblance among the members. The nature of the resemblance shifts 
from member to member, without there necessarily being any one clear respect in which all are alike. 
The same is true of the meanings of most words in actual use. The features that are important shift 
from use to use. A feature which is essential in one use may be unimpo..dnt or even absent in another 
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We recognize that there are some cases in which a core meaning, plus general principles of inference 
and instantiation, can account for contextual variation in word meaning. As Johnson-Laird says, "All 
open-class words ... arc closer to being pronouns than k rommonly recognized; they pronde a 
relatively simple semantic framework that can be enriched by inferences based on knowledge. These 
inferences concern the situation designated by the sentence, and different linguistic contexts highlicht 
different aspects of lexical meaning" (1987, p. 187). 

However, we deny that it is possible to set up a "relatively simple semantic framework" to account for 
much of the variation in meaning represented by related senses found in dictionaries. A problem with 
stretching a single word sense to cover every related use is that the sense necessarily becomes 
increasmgly abstract-that is to say, bland and vague. No meaning of give that is general enough to cover 
all the clearly related senses would account for the fact that you can give, but not grant, someone a 
shove. In general, the effect of stretching word senses to make them maximally inclusive is to cede any 
linguistic basis for explaining nuance. Ail richness and particularity of understanding is left to be 
explained in terms of knowledge of the world. 

The proposal that word senses are fashioned to be maximally inclusive and parsimonious also leads »o 
a psychologically implausible picture of vocabulary acquisition, in that learning new senses of a word 
would have to diminish the amount of information in one's lexical representations. Say, for example, 
a person knew the word deliver only in the sense of delivering mail. If that person then learns the senses 
of deliver for delivering babies, and delivering speeches, his or her knowledge of the word deliver 
becomes general to the point of being vacuous. 

Furthermore, although a general core meaning might, with the help of general principles of inference 
and instantiation, account for the range of meanings that a word can take on, this approach will 
generally not account for which specific meanings a given word actually does and does not take on. 
Consider the dictionary entries for the word gusset and gore (entry #2) from Webster's Vtird New 
International, unabridged: 

gusset: 

la: a piece of chain mail or plate at the openings of the joints in a suit of armor 

lb: a usually triangular or diamond-shaped insert (as of cloth or leather) placed in a seam (as 
of a sleeve, pocketbook, glove) to give ease or expandability; also, a similar piece made bv adding 
stitches at the heel of hose 

Ic: any V-shaped or triangular insert (as in a sail or skirt) as: (1) an elastic insert in a shoe 
upper (as for providing a snug fit) (2) gusset tongue: bellows tongue 

Id: a pleat or fold esp. in bookbinding 

2: something resembling a gusset, as: 

2a: a gore of land 

2b: (1) gusset plate: a connecting or reinforcing plate that joins the truss members in a truss 
jomt or fits at a joint of a frame structure or set of braces, (2) gusset stay: a bracker or angular 
piece of iron for strengthening angles of a structure (as an airplane or a bnugc) 

3: a pretended abatement in heraldry consisting of either side of a pall ^vithout the top openmg 
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la: a small usually triangular piece of land 

(as rJs^;? " '"'^ '>-8 -ger political d.v.ions 

l^dXtZnT'^' -ualiy sparsely settled o. uninhab.ted pa.t of a county (as in Maine 

a1a/mlnV(Trslcir" ''''' °^ ^^'^ °- --'^ ^-ed lengthw.se sections of 
2b: gusset Ic 

3a: one „, ,he „ia.^ pi«„ ^ ^^^^ ^^^^ ^^^^^ ^^^^^ 

fh= S4 ofV«t^""'°" " ' » -'face „, . sphere i„ 

of each ,bat would toippiish ,h=m fr o„ eaTh oS oi r,™ ' ? ™»''"es 

Tbese two words sbow c'early that theTa^tlv ,e™i " 8","»1 °"»'^'>S such a-, triangle - 

to hijh frequency worn, like j,« resemblance pattern of related meatungs is not restriScJ 

P^dlfw^h^SrjiSSrf'Ssl" ""'^^vL^' - 

reasons why gusse, took on sot.^ me^B ^dZ. ,1k W^""""! 
present-day English U concerned i Str^Cwl others, but >^ la. as a description of 
do. Likewise, it is . historical accident s"to sS^^fl ff,,i'"l"'"'^ 'P'""' ""^^ '•'=y 

birds have not come to be called or I th» L^tTf ^ " »atcr 

possible e„.nsio.s seem no i«Ta:oX'X„"helrth*?aTtlX°rcl\d"°''* "'""""^ '^"^ 

rai!s;^s:^t!S:Setj^L'rr;Te,f^^^^ 

."eanings; the difjcdt, is tTat thereVe no^t^^cipSiTrrr''' "-""^ 
related but d,- • ,guishable senses, Z S^rit^, ,1" . f 

l-TeLSr' °' ^'-^ ^1 *iiSrnnrdSc,^T;eL^: iri's ^c;^ 

f t''rorES:k«^~ 3?^-^^^^^^^^^ constitutes a nor™, 

of par.mony does not chara«.ri« the way Litri™ .:5iro^S^- hlTmS : 

Accounling for Contextual Vari.lion Within ,he Standard Model 

.oSie'-sriLz^'eittTv^^^^^^ 

/oAm gflv^ Fra/j^ /iv^ dollars. 
John gave Mary a kiss. 
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The doctor gave the child an injection. 
The orchestra gave a stunning performance. 

The initial problem, of course, is to postulate a general meaning for give that would account for these 
tour sentences. At first glance, this scemn next to impossible, since the intersection of the four senses 
oigive here seems dose to empty. However, ir is not logically aeccssary for core meanings to sii.^ply 
b;; the intersection of the features of all theii contextual realizations. One can conceive of rules of 
contextual modification of meaning at change features as well as rules that add them. Such a rule 
would, in effect, provide an algorithm for answenng the question, "Given the core meaning of give 
(which could be fairly specific), what meaning is the word ^ve likely to have in this context?' 
Essentially, sucb rules would be principles of metaphor. 

Such rules are not inconceivable. Take for example the meaning of hi^ in high mountains, high prices, 
high morale, and hig^ opinion. The intersection of the meanings of hi^ in these various contexts is 
presumably something very general, something like "positive polarity," which would not be sufficient to 
distinguish high from other positive polarity words like large, fftat, talk or good, especially since others 
of these presumably wouic* ' so need very vague core meanings, if core meanings were defined simply 
in terms of the features comi^^, -) all uses. However, one could also account for the meanings of /i/^/i 
in the phrases mentioned by givir.^ it a fairly concrete literal definition (for example, "positive polarity 
with respect to the vertical dimension'^), and then stipulate a principle of semantic extension that says 
something like this: When terms having positive or negative polarity do not literally fit their contexts, 
preserve their polarity and adjust the scale on which the polarity is expressed to fit the context. 

In the case of ^V^ a kiss, it is a little harder to state the principle explicitly, but the point is the same. 
Given some core meaning o{ give, which couid be fairly specific, someone who knew anything about 
kissing, but had never heard the phrase ^Ve (someone) a kiss, could easily figure out what give means 
in this context using principles of metapbnrical extension. 

The notion of core meaning plus metaphorical extension fits a\s^kwardly with the standard model, 
however. Logically speaking, within the standard model it would be expected that a meaning derived 
by metaphorical extension would Sc based on criterial features, since these features are the core meaning 
of the word in this model. However, as a matter of fact, metaphoncal extensions are typically based on 
noncriterial features. Birds are not free by definition; nor i.^ craftiness a criterion for being a fox. 

And of course, ^ven where the idea of a core meaning plus metaphorical extension is highly plausible, 
this does not guarantee that a person does not also store contextual variants. Fo; -sample, the use of 
the phrase the White House to refer to the ;;xecutivc branch of the U.S. government h ^ xrt of a 
piedictable pattern of metonymy (cf. the Kremlin, the Vatican, No. 10 Downing Street). But that does 
not mean that the extended meaning of the White House isn't lexicaUzed, that is, permanently stored in 
the person's lexicon. 

The problem k not that the relationship between the meanings of words is so opaque; in fact, we have 
just considered the possibility that it is predictable, given the contexts. The specific evidence against the 
core meaning approach in case of give, for instance, has to do with distribution; the fact that you can give 
someone a shove, but not grant someone a shove, you can g:ve a performance, but not donate one, at 
least not in the same sense. 

One way to solve the problem is to include in the lexical entry, along with the co/e meaning, a 
specification, of the contexts in which the word is used. For example, give (permission, kisses, 
performance, medical treatment, . . .); grant (permission, favors, . .). This move to salvage a 
parsimonious "core meaning" model seems to work satisfactorily in a number of cases. And, some will 
believe that it is inherently more satisfying to account for the differences in usage of give, grant, and 
donate as conventionalized restrictions on their distribution, rather than in terms of subtle differences 
in meaning. 

But representing the distribution of these meanings in the lexicon, however it is done, necessarily 
comprises the principles of abstractness and parsimony; exemplars of the uses of words are being 
incorporated into lexical entries. In terms of the overall parsimony of the model, there is a trade-ofl 
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between specilymg distribution, or postulating subtle distinctions in sense. If you arc allowed to specify 
the contexts in which the word occurs, you can often get by with a vaguer, more generic account of its 
meaning. Conversely, more precise formulation of related meanings of a word will, at lecst in some 
cases, account for the wor J's distribution in the language. WTiich way is most parsimonious is not clear 
a priori. Nor is it apparent that the two alternatives-having separate senses of give for give a kiss and 
give a recitou or having one sense, but listing the specific contexts in which this one sense may occur- 
are any more than notational variants. 

However, some facts about English usage appear to require including information about contexts in the 
mental lexicon. For example, the fact that in German you can have high punishment (hohe Strafe), while 
in English punishment would be called severe, docs not seem to be a fact about the meanings of the 
words high and punishment in English and German, but simply arbitrary facts about stock phrases in the 
two languages that must be recorded in the lexicon, if people are to use the words appropriately 
Similarly, sometimes contextual information must be included in lexical entries for syntactic reasons. 
For example, it must be represent-d in the lexicon that donate, unlike the related verb^ve, cannot take 
an indirect object without a preposition; one doesn't normallv say He donated the museum a large sum 
of money. 

Not even the compromised model, in which the contexts in which a word is used are specified along with 
a core meaning, saves the standard theory from the embarrassment of multiple related senses. Often 
two or more senses must be postulated, because a word can have different meanings m identical 
contexts. For example, the sentence The doctor delivered the baby is potentially ambiguous, since this 
sentence could be used to refer to a situation in which the doctor brought the baby to a house in his 
station wagon. 

In summary, we argue that it is typical of natural language that there are arbitrary, conventional 
restrictions on how words are combined in sentences; our knowledge of words must therefore include 
exemplars of their usage, in some form or another. The fact that words have multiple, contextually- 
contingent meanings is an anomaly for the standard model. Moves to account for contextual variation 
in meaning within the standard model are at best partially successful. 

On the other hand, family-resemblance relationships among meanings are the natural consequence ol 
a model of meaning in which the meaning cf words is represented, not in terms of a maximally abstract 
generalization that covers all members of the set, but in terras of specific uses of the word. That is. for 
example, the application of a word like game to a new activity, say knocking bottles off a fence by 
throwing rocks at them, may be based, not on a consideration of whether that activity meets some 
general criteria for games, but on the basis of its similarity, say, to bowling. 

Semantic Decomposition 

In the standard model of word mearmg, the connotation of a word consists of the set of necessary and 
sufficient conditions required for the serious, literal use of the word. Thus, the standard model of 
meaning presupposes some type of semantic decomposition, the analysis of meanings into coi ditions, 
or semantic features. We intend semantic feature to be an unproblematical and theoretically leutral 
term, although of course the expression has taken on a patina of meaning because of (he way it ha.> been 
used by various theorists. 

A well-known example of feature analysis is the decomposition of bachelor into HUMAN, ADULT, 
MALE, NEVER MARRIED (Katz & Fodor, 1963). The features are capitalized (cr marked in some 
other way such as < human > < adult >) to indicate these are conceptual distinctions instead of just other 
words m English. 

Any attempt to analyze word senses into component distinctions, however modest and informal, could 
be called semantic feature analysis. Some lypc of semantic decomposition is almost unavoidable in the 
description of word meanings. That is. it would be difficult to deny that the meanings of some words 
overlap. One could hardly deny that some concepts are complex, and therefore reducible to simplci* 
concepts. Nor could one deny that there are groups of words that share common elements of meanini? 
For example, one would scarcely want to argue that people have multiple, totally independent thccncs 
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of gender, one for the word mother, one for the word uncle, one for the word niece, one for the word 
grandfather, and so on. 

However, the assertion that some word^ share semantic content docs not amount to a theory of semantic 
features. Any theory of semantic features necirssarily involves stronger cliims. A feature theory strong 
enough to be interesting will have at least several of the following properties: 

CrUerial: Word meanings can be characterized in terms of a set of necessary and sufficient components. 
Atomic: Semantic features are not further reducible within the linguistic system. 
Sense based: Features can be defined in terms of sensory /motor constructs. 
Binary Features have two, mutually exclusive and opposite values. 

Unstructured: Woru meanings can be described in terms of unstructured or unordered sets of features, 
for example, a bachelor is someone who is human, and adult, and male, and unmarried. 

Exhaustive: All word meanings in a lanoruage can be analyzed exhaustively into semantic components. 

Parsimonious: The set of semantic features necessary to describe the vocabulary of a language is small, 
at the very least smaller than the number of the words in the language. 

Universal: All word meanings in every language can be mapped onto a single universal alphabet of 
semantic features. 

SufTlcient to account for stntence semantics: Notions of sentence semantics such as s>nonvmy, 
anaiyticity, ambiguity, meaningfulness, and anomaly can be adequately characterized in terms of semantic 
features. 

Causally involved In comprehension: Word meanings are understood by decomposing them into their 
semantic features. 

Convergent: The elements of meaning that are primitive in a description of aduJt linguistic knowledge 
will also be developmentally primitive, and also computationally primitive, that is the anils of on-hne 
comprehension. 

Although it is often extremely convenient to talk in terms of components or features when discussing 
word meanings, one docs noi have a theory of semantic features at all unless one can maintain at least 
some of these properties. And strong objections have been raised to any theory of semantic features 
defined with enough precision to have any teeth; each of the above p ^perties can be shown to be 
problematic. 

Can Semantic Features Capture Necessary and Sufficient Conditions? 

In the strongest versions of standard semantic theory, the goal is to specify the semantic features thai 
are individually necessary and joiiitly sufficient for the sober, good faith use of a term. As we have 
already said, thii» goal has turned out to be impossible in principle for natural kind terms (Putnam. 
1975), and perhaps inappropriate for other sorts of words as well (Green, 1984). 

There are some words that may be considered to have truly necessary features. It does seem, for 
example, that a person's being Tonmarried is a necessary condition for literally callbg thib person a 
bachelor However, this f&ct docs not force one to the conclusion that even the meanings of these words 
can be described in terms of necessar>' and sufficient conditions. Continuing the example, the meaning 
of bachelor is influenced by a complex social milieu that is changing before our very eyes The 
bourdaries of its meaning are fu^^y and constantly shifting. Does a man become a bachelor if he 
remains unmarried past his twenty-first birthday? Or, not until his thirtieth? Does a young man have 
to move out of his parents* home to oe called a bachelor? Can a divorced man properly be called a 
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bachelor?^ Is a man in a "long-term relationship" with a woman a bachelor? Is a man in a gay 
"marriage" a bachelor? Is a middle-^aged woman who has never been married a bachelor? Has the 
meaning of the term shifted so that one now has to say confirmed bachelor to get across what was once 
communicated by bachelor alone? 



Are Semantic Features Atomic and Sense-Based? 



A feature is atomic, or primitive, if it is not fu-ther reducible. A feature is sense-based if its presence 
or absence can be directly determined using the eyes, ears, nose, or finger tips. Looking again at the 
representation of the meaning of bachelor as a string of features, MALE is perhaps sense-based, but 
It seems to be a bundle of separable characteristics rather than one ineducible feature. Obviously, 
NEVER MARRIED fails on both counts. It would be impossible to tell from looking at a man whether 
he had ever been married. Marriage is a complex social form that should be reducible to more primitive 
elements. NEVER is a peculiar, or at least special, modifier; yet the simpler NOT doesn*t quite work, 
because a man who is divorced or a widower is not (usually) called a bachelor. Bachelor is not an 
exceptional case. Few of the semantic features that appear in published analyses appear to be both 
atomic and sense-based. 



Are Semantic Features Biiiaiy? 

Semantic features are usually treated as binary. It is customary to indicate that a word has a feature 
with a plus sign and that it does not have a feature with a minus sign. Words coded plus on a feature 
usually denote the presence of some attribute, those coded minus denote the absence or relative absence 
of the attribute. When pairs of words with contrasting values on a feature are examined, the ones coded 
plus usually are linguistically "unmarked" whereas the ones coded minus may be "marked." A word is 
"marked" if it contains a prefix, typically one indicating negation such as un- or in-; for example Jn^nt//v 
and unfriendly, animate ana inanimate, and relevant and irrelevant. When a word coded plus and a word 
coded minus on a feature are used in a compound expression, the preferred order is the plus word 
before the minus word; thus, we say good or bad instead of bad orgoo> : and husband and wife instead 
of wife and husband. 

The tradition of representing meanings as strings of binary, or two-valued features coded plus or minus 
lays traps for the unwary. VERTICAL seems to be atomic, sense-based, and universal. What 
+ VERTICAL means is intuitively clear. Putting it in words it means, roughly, perpendicular to the 
plane of the earth's surface. However, there is an indefinite range of possibilities for -VERTICAL. It 
decidedly does not mean horizontal. Nor dees -BLACK mean white, since many shades of erav arc 
possible. ^ ' 

In general, a determinate meaning for a word coded minus on a feature is warranted only when the 
feature divides the world into two mutually exclusive and exhaustive categories. Thus, strictly speaking, 
TRUE cannot be equated with false, because statements may be partly true, answers not entirely wrong, 
and so on. Yet, stipulating that true and false are to be taken as mutually exclusive and exhaustive is 
a simplifying assumption that powerfully aids reasoning. Similaxly, it can be argued that equaling 
-MALE with female is a serviceable approximation to reality, despite the existence of bisexuals, 
transvestites, and hermaphrodites. 

Some scholars have modified semantic feature theory by treating jome semantic features essentiallv as 
continuous variables (e.g., Labov, 1973; Ukoff, 1972). Although allowing features to apply to ing 
degrees may appear to be a relatively minor extension of the notion ' semantic feature," this extension 
in fact impacts on the notion of criteriality of features, which is at the very core of the standard model 
of word meaning. 

Can Word Meanings be Described as Unordered Sels of Features? 

The simple versions of feature theory assume that word meanings are unordered sets of features. 
However, it is easy to show that some sort of internal structure is needed to represent many word 
meanmgs. An example is husband. To say that this word is coded +.AJ)ULT and +MALE is merely 
to express boundary conditions; the distinctive semantic content is that a husband is the spouse of some 
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woman and, reciprocally, that this woman is his spouse. Just writing down SPOUSE doesn't reveal who 
IS related to whom, and what characteristics they must have to enter into the relationship. 



Various methods have been proposed for representing the internal structure of word senses. One is in 




... . - « . , praise. L,a«,ii 

ol the capitalized words m the foUowing represents a case or role. Verbs of judging involve a Situation 
which is an action, deed, or state of affairs, that may impact favorably or unfavorably upon the Affected' 
The Situation may have been caused by a Defendant. A Judge renders a moral judgment about the 
Situation or the Defendant's responsibility. To illustrate, Fdlmore gave the role structure of accuse as- 
A Judge tells an Addressee that a Defendant is responsible for a bad Situation. 

According to Filhnore, a role is a variable to be assigned a value--or a slot to be instantiated-based on 
information from utterances being interpreted. For example, in the sentence Sa,a accused Rick of 
leaving the gift, the Judge is Sara and the Defendant is Rick. The Addressee is not clear; perhaps this 
role is also filled by Rick. Though leaving a gift is not ordinarily bad, it must fill the bad Situation role- 
otherwise, the sentence will not make sense. This leads to the further inferences; Sara is accusing in a 
playful manner, the gift is something outrageous, or Rick left the gift someplace where it should not 
have been left (e.g., at home instead of bringing it to a birthday party). Fillmore is making [he same 
point frequentiv made by schema theorists: People strive for coherence; they fiU slots with the 
information given when possible, by inference when necessary. 

The point here is »hat there are groups of words-such as buy and sell, or accuse, blame, and 
cnftc;ze--whose meanings differ not in which features they contain, but in the structure into which these 
features are organized. 

Can Semantic Features be Both Exhaustive and Parsimonious? 

Can a set of features be constructed which both exhaustively describes the vocabulary of a language and 
which is substantiaUy smaller than that vocabulary? Analysis of the words in a language into semantic 
features is a reasonable enterprise only if the number of semantic features turns out to be smalle- than 
the number of words they describe (Fodor et al., 1980). Knowing whether this is the case depends on 
an exhaustive and detailed semantic analysis of at least one language, something that is unlikelv to be 
available in the foreseeable future. 

There is a reason to doubt that semantic feature analysis will result in a vocabulary of features 
considerably smaller than the vocabulary of words. Having a relatively smaU number of features 
depends on the features required to make distinctions in any one semantic domain applying in several 
other domams. '^'^ ' " 

For example, an absolute minimum of eight semantic features would be necessary to describe the 
nieanings of swag^r, stmt, stride, saunter, pace, amble, and stroll-one feature, at least, for what 'h-v 
share, and at least one feature each that distinguishes them from the others. This semantic featur: 
analysis will result m a set of features smaller than the vocabulary it describes only if these features a-e 
used elsewhere. For example, if there were an adverb for each of these verbs that exactly captures how 
the verb differs from the base verb walk (i.e., if the word arrogantly expresses exactly that which 
distinguishes swager from walk, vigorously that which distinguishes stride from walk, etc.), then there 
would be 14 words (not counting walk), and only eight features. 

Whether analysis of meanings into semantic features would actually result in such economy depends on 
two things. First, there is the question of whether or not one is satisfied with definitions such as "walk 
arrogantly (or swagger. Similarly, one can argue that bachelor means more than "unmarried adult male ' 
One can add features such as "eUgible'' in the attempt to capture the subtleties of word sense that simple 
clusters of features do not capture, but this diminishes the chances that the number of semantic features 
postulated will be smaller than the number of words. 
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Second, there is the question whether the fine-grained distinctions necessary in any given domain wUl 
be appUcable la any other domain. For example, the semantic difference between roller skate and 
skateboard, whatever it is, is not likeiy to play a role in very many other semantic domams. One could 
even go through the labor of trying to specify the semantic feature, perhaps INDIVIDUATED FOR 
SYMMETRICAL BODY PARTS, and trying to find other pairs of words that seem to be distinguished 
in terms of the feature, for instance, mittens versus muffst culottes versus skirts, and, maybe, skis versus 
toboggans. Still, this leaves unvitiated the force of the argument that there will be an unmanageably 
large number of extremely specialized features if one tries to account for all of the fine distinctions in 
the vocabulary of a language. 

One way to try to get around this problem is to recognize two types of semantic features. An example 
of such a division is Katz and Fodor's (1963) distinction between semantic markers-features which occui 
m many word meanings-and distinguishers, the semantic content unique to ihe word, that cannot be 
furthei analyzed into features that recur in other meanings. 

It should be noted first of all that such a move involves abandoning several of the properties often 
postulated for a theory of semantic features. Taken together, semantic markers and distinguishers are 
no longer necessarily fewer m number than the words in the vocabulary of the language they describe. 
But semantic markers alone do not exhaustively describe the meanings of words. On the other hand 
there may be a little mo. e hope for having semantic markers (but not distinguishers) meet some of the 
other conditions postulated for semantic features, for instance, being atomic and universal. 

Katz and Fodor had set as the basic goals of their theory accounting for the sentence-level semantic 
phenomena of ambiguity, analyticity, meaningfulness, anomaly, and synonymy. They had attempted to 
constrain their semantic theory by claiming that these phenomena could be described in terms of 
semantic markers, with no reference to the information in distinguishers. Bolinger (1%5) argued 
cogently that this claim could not be maintained, effectively putting an end to the use of this distinction. 

Are Semantic Features Universal? 

Some semantic features, such as CAUSE, recur in the descriptions of numerous languages, reflecting, 
presumably, either innate properties of human cognition, or universal properties of human experience' 
While It is reasonable to suppose that some features are universal, any strong claim of universalitv of 
features would be incompatible with the criterion of sxhaustiveness. 

A strong test of universality would be to show that there is no semantic distinction between two words 
in any one language that cannot be expressed in terms of features independently motivated for many 
other languages. To carry out such a test for even a single semantic distinction in a well-researched 
domam, such as, for instance, kinship terms, would be a daunting task. 

Are Semantic Features Sufficient to Account for Sentence Semantics? 

In addition to providing a gloss on the meanings of individual words and furnishing the differentia for 
sets of related words, features are invoked to explain a variety of other aspects of meaning and language 
understandmg. Accordirg to Katz and his associates f Katz, 1979; Katz & Fodor, 1963; Katz & Fustal, 
1964), whose seminal writings can be credited with the surge of interest in semantic feature analvsis 
beginning m the mid 1960s, features play a pivotal role in resolving such matters as whether sentences 
are ambiguous, anomalous, or tautological. 

The treatment of anomaly is illustrated by the sentence The dream was tall. It doesn't make sense 
because dreams are not the sorts ot things that can be tall. According to the theory, the problem is 
that dream does not have the feature + PHYSICAL OBJECT, which is required for the use of tall 
It IS the violation of this so-called "selection restriction^ that is said to make the sentence anomalous 

The compre»jensiveness of the feature theory proposed by Katz and his associates makes it \erv 
appealing and, at one and the same time, opens it to telling criticism. It can be argued that the theorv 
sweeps under its skirts issues that can't be, or shouldn't be, decided solely on the basis of knowledge of 
the language. Feature theor>' glosses the sentence "^he is ihe father of her children as anomalous because 
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father is marked +MALE whereas she and her are marked -MALE. The sentence expresses a riddle 
to be sure, but the difficulty is not in understanding the assertion. Instead, the obstacle to 
romprehension is conceiving a situation in which the sentence could be a good faith uiicrance. A 
mundane possibility is a single woman who has to rough house with her sons, teach them to play ball, 
and so on. Matched to the scenario father is being used metaphorically. With just a little more 
imagination it is possible to envision ^ setting for the sentence in which father has a literal meaning: 
Suppose a transvcstite with a family has a sex change operation (see Lyons, 1977, p. 305). 

Are Semantic Features Causally Involved in Comprehension? 

The weakest claim that could be made for some sort of semantic feature theory as a model of meaning 
IS that of what Chomsky (1965) would call "observational adequacy." That is, the semantic features are 
hypothetical constructs that attempt to describe the content, although not necessarily the form, of 
people's knowledge of word meanings. 

For example, at this level, saying that words such as brother, uncle, and bachelor have the feature 
+ MALE is simply an assertion that anyone who knows what these words mean applies them (seriously 
and literally, at least) only to male individuals. 

A stronger claim about semantic features can be made, namely, that they describe not only the content 
of people's word knowledge, but also the form it takes. From this perspective, features are 
psychologically real. A person who knows what the word means has this meaning represented in his 
or her mind as a set of semantic primitives. 

This stronger claim, that the componential representation of word meaning is somehow isomorphic to 
the cognitive representation and processes involved in knowing and understanding words, gives rise to 
specific predictions about language proccssing-for instance, that speed and difficulty of processing 
should be predictable from the number and nature of semantic features involved in a word's meaning. 
Despite numerous experiments, unequivocal support for any stro.ig version of this claim has not been 
found (Fodor et al., 1980; Kintsch, 1974). 

Are Semantic Features Convergent? 

Can there be a convergent theory of semantic features? That is, can one postulate a single set of 
features which are developmentally primitive (features which are acquired early, and out of which later 
meanings are constructed;, computationally primitive (the elements involved in on-line comprehension), 
and definitionally primitive (the set of features out of which all meanings in adult language can be 
constructed)? ^ ^ 

Carey (1982) points out that the classical view of semantic features assumes this kind of convergence 
(p. 351). As we have just argued, there is no evidence that semantic features are computationallv 
primitive at all. Carey argues further that the features which would best describe distinctions among 
adult word meanings are not likely to be developmentally primitive (p. 367). Some of the distinctions 
among word meanings known by adults can only be described in terms of theories of the world which 
the young child does not yet posses. Hence, it does not seem possible for the same set of features to 
underlie children's acquisition of word meanings and adults' use of them. 

Semantic Features As Lexical Organizers 

Our discussion of what is known about semantic features left us with something of a paradox. On the 
one hand, some notion of semantic decomposition is ahnost unavoidable in any discussion of word 
meanings. As we said, it is hard to believe (by way of example) that a speaker of English would have 
separate, independent theories of gender for each term in ihe language that involved specification of a 
person being male or female. In other words, such terms must share common ser lantic elements. On 
the other hand, as we argued, any of the specific claims for a theory of semantic features that would be 
strong enough to have some teeth to it is problematical. 
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One step toward resolving this paradox is to attribute a more limited role to semantic features in the 
mteraal representation of word meanings. Semantic features can be treated as generalizations about 
the overlap in meaning among words. In this treatment, knowledge of the meaning of individual words 
is not necessarily embodied in features. An implication is that children initially could learn to use words 
such as father, mother, sister, and brother correctly without ftilly recognizing the semantic relationships 
among them. Only later would they come to understand the features that structure kinship terms. 

Several researchers in the area of child language acquisition have suggested essentialh this position 
For example, Nelson (in press), after reviewing i range of language development research, says 

All of these findings suggest a system that is at first characterized by independent lexemes, that 
are related to expcrientially-bascd concepts but are not related directly to other lexemes, 
subsequently becoming reorganized in terms of relations between lexical items, a process that in 
turn leads to new insights into both the linguistic and the conceptual system::. 

Nelson (in press) goes on to describe language acquisition in terms of three phases. The third period 
is one of revision, re-organization and consolidation of lexical items within domains of related words. " 

Similarly,^ Gleitman and Wanner (1982), summarizing research on the acquisition of word meanings, 
say that "each early word is an unopened package; only much later does productive lexical analysis 
begin to appear" (p. 12). Gleitman and Wanner's point applies most obviously to the internal 
morphological structure of words. That is, children may learn the word Thanksgiving, and use it with 
some dijgrec of appropriateness, before they appreciate that it has anything to do with giving thanks. 
However, the point applies equally well to the internal semantic structure of words. Bowerman (1982) 
gives specific cases of children learning to appropriately use semantically complex words like drop and 
break, which involve the notion of causation. But only later do they show, through overextensions in 
their use of other words, that they have recc^ized that the feature CAUSE has recurrent organizational 
significance in the EngUsh lexicon. 

One way in which this perspective on semantic features differs from a stronger theory is that it abandons 
the claim of exhaustiveness. That is, it does not assume that words can be exhaustively analyzed into 
semantic features. Rather, the point is that some parts of word meanings may be factored out, so to 
speak. Bowerman (1982) theorizes that distinctions which have recurrent organizational significance may 
function differently from those that do not. "Meaning distinctions that are relevant only to one or a 
small handful of language forms may typically be left implicit, even if the child might in some sense 

recognize them In contrast, meaning distinctions that run systematically through a variety of forms 

. . . may be pulled out for special attention under certain circumstances.'* 

The pomt is that there are good reasons to distinguish conceptual distinctions a child is able to make 
from the subset of those distinctions that come to play a role in the organization of the child's word 
knowledge. Carey (1978) explains this in the followmg way: 

The child akeady knows the words big and littie before he learns any of the specialized spatial 
adjectives. Since the core comparative structure (including polarity) is part of his early 
representations of big and iittie, these features are already available as lexical organizers when 
the child encounters a word like wide or low. By "available as a lexical organizer," I mean already 
part of the lexical entry of some word. Although the features underlying the dimensionality of 
spatial extent are part of the child's conceptual system, their linguistic relevance might not yet 
be recognized That is, the child might not yet realize that the spatial concepts mark contrasts 

between words It is not unreasonable to assume that features available as lexical organizers 

are mapped onto new words more easily than those that are not yet avadablc. (pp. 281-282) 

A key assumption is that recognition of the semantic relationships among words, and hence of those 
parts of word meanings that are shared, is not a prerequisite for appropriate use ot words and some 
level of understanding of them. As Gleitman and Wanner (1982) put it, "internal analysis of the word 
unit is a late and variable step in cognitive-linguistic development" (p. 13). Explicit representation of 
semantic relationships may be an asymptote that adults approach, rather than a characteristic of 
linguistic representation at all ages. Thus, Carey (1978) hypothesizes: 
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It IS possible that even some adults do not discover all the regularities in the domain never fuUy 
representing, for example, how fat, wide, and thick differ, although they know very weU some 
paradigm cases of things thai can be each. (p. 28) 

In the perspective being outlined here, then, semantic features are in some sense optional. People can 
and often do, use and understand words correctly without recognition of the relationships that hold 
among them. If this is so, one can then justifiably ask, what function do semantic features have^ An 
answer is that a knowledge of features facilitates word learning. In Carey's (1978, 1982) research for 
instance, learning a new color word was easie- iT the construct r.f a color word was already part of a 
child s semantic system. Similarly, learning the word niece should be easier if one already knows that 
gender plays a systematic role in differentiating kinship terms. 

If analysis of meanings into semantic features is optional, but aids in word learning, one might expect 
the tendency to recognize semantic features to be associated with greater facility in word learninR 
Van Daalen Kapteijns and Elshout-Mohr (1981, 1987) argue that the tendency to analyze word meanings 
into components is associated with higher verbal ability. They make this claim on the basis of their 
analysis of protocols from a task in which subjects had to infer the meaning of a novel word from a 
series of context sentences. The task paralleled the one used by Werner and Kaplan (1952), except that 
in this experiment, the novel words represented novel, complex concepts. In trying to integrate 
intormation from succeeding sentences, the higher ability subjects treated their initial hypothesis about 
the word s meaning as a decomposable structure, adjusting the meaning to fit other contexts bv keeping 
some parts constant and varying others. The lower ability subjects, on the other hand, tended to treat 
word meanings as unanalyzable wholes; if the meaning they chose for the first sentence didn't fit the 
second, they would start from scratch rather than trying to alter parts of their first hypothesis. 

In this section, we have outUned an alternative perspective on semantic features. Semantic features 
are theorized to represent, not essential conditions for comprehension and successful use of words but 
attributes useful for organizing knowledge of word meanings. The evidence cited in this section is 
consistent with the position that we outlined earUer about the role of abstractness and parsimony in 
lexical representations: We do not deny that people see similarities among word meanings and 
represent generalizations about these sunilarities in some way. What we deny is that word meanings 
are represented exclusively, or even primarily, at the maximal level of abstractness. 

Sentence Meaning 

Associated with the standard model oi word meaning is the view that sentences have a literal 
compor-tiona^ meamng mdependent of world knowledge, context, or speaker's intentions. That is to 
say, the :,tandard position is that the meaning of a sentence is determined by core word meamnes 
syntactic rules, and nothing else. We will attack this view in two ways: First, we will argue that for a 
large number of English sentences, the literal sentence meaning is not a compositional function of the 
meanings of the component words. Second, we will argue that the concept of literal meaning itself is 
not tenable. 



Sentence Compositionality 

One of the fundamental insights of generative grammar was the creativity of normal language use As 
has been pomted out repeatedly by Chomsky (1965) and others, knowing English (or anv other 
language) docs not consist m knowing a list of specific sentences. Rather, the rules of syntax" can be 
applied creatively, so that many of the sentences one speaks or hears, although fully understandable 
are occurrmg for the first time in the history of the language. This is no doubt true, for example of 
the vast majority of the sentences in this report to the extent that the reader understands them, i't is 
not because these exact sentences have been read or heard before. 

We do not deny that understanding and producing English involves creative use of the syntactic rules 
ot the language. However, the fact that language use is truly creative does not mean that it is fully 
creative. Although grammar of a language must specif how words are put tog-ther to form 
sentences, it does not follow that when speakers or vkriters form sentences, they start from the smallest 
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units, building phrases from individual words, and sentences from phrases. On the contrary th- 
frequency of idioms, collocations, cliches, and stock phrases in normd language use shows that normal 
speerh is often more like baking a cake from a mi>. fhan like baking a cake from scratch. 

Normal language use is full of prefabricated units. Such units, since they function as units tend to 
take on meamugs above and beyond what is predictable from the meanings of their component parts 
This IS most obvious at the level of words. The fact that an iceman brings the ice, but a snowman is 
made out of snow, for e-ample, cannot be determined on the basis of the meanings of ice, snow, and 
inan alone. It would be safe to say that a large proporUon of the complex words in the language, and 
the majority of the most frequent ones, have meanings tha* toj^ey something beyond what is predictable 
on the bas-s of the meanings of the parts alone, however, there is also rampant semantic irregularity 
above the level of the word. The idiosyncrasy of the meaning of a phrase is sometimes minor, the fact 
that one says ham and eggs instead of eggs and ham may eacode only the fact that ham and ecus is a 
traditional combmation of foods, unlike, for example, eggs and broccoli. But there is a continuum of 
semantic u-regularity, ranging from phrases like ham and eggs to fuUy opaque expressions like kick the 
bucket, or by and large. 

In between are a variety of fixed phrases like flew the coop, chanf>e one's mind, toe the line, make ends 
meet, pay one's own way, make oneself at home, or to think better of something. In some sense, the in- 
between cases are the most problematic for a compositional model of sentence meaning. On the one 
hand, the meamng of the phrase is more than, or different from, the sum of its parts; on the other hand 
It cannot be treated sunply as a long word, because there is recognizable internal structure, and some 
ol the Items uithrn these phrases do have their usual meanings. Furthermore, the number and 
frequency of this type of phrase in normal language use is far greater than most people are aware Take 
your chances and make yourself at home may sound like examples of perfectly regular, liberal phrases 
until you realize that take your nsks and make yourself at house, although presumably similar in meaning! 
don t sound like normal English. 

Fo. a substantial proportion of the phrases used in English, then, it is simply not possible to treat their 
meanmgs as a componential function of the meanings of their parts. Rather, the meaning of the phrase 
or at least some aspects of its meanings, must be considered to be associated with the phrase as a whole 
This raises the possibility that even for some of the phrases that have fully predictable meanings the 
meaning of the phrase as a whole is also stored in memory, rather than being computed from scratch 
every time the phrase is used. Of course, for truly novel phrases, meanings wiU have to be computed. 
But there is little reason to cling to an a priori notion of parsimony that would necessitate for a phrase 
with a perfectly regular meaning, that the listener compute its meaning afresh on the fifth, tenth, or 
hundredth time he or she heard it. 

Indirect speech acts also pose a problem for the concept of literal sentence meaning. The sentence 
Can you pass the salt? literaUy asks whether someone is able to pass the salt, but what it actually means 
IS please pass the salt. A somewhat similar problem is posed by idioms. The sentence He kicked the 
bucket literally declares that someone struck a blow with his foot against a type of container, but v,hat 
It ordinarily means is that someone died. 

The way proponents of compositional theory propose to deal with indirect speech acts and idioms is 
to distmguish between sentence meaning and utterance meaning. Utterance meaning is so called because 
the sentence is supposed to have been uttered by some speaker in some situation for some reason. 
I hus. Do you have change for a dollar? is a question at the level of sentence meaning but usually is a 
request at the level of utterance meamng. Sentence meaning is supposed to be the unadorned linguistic 
interpretation, while utterance meaning includes elaborations based on linguistic context, situational 
context, and background knowledge. Ti.e distinction between literal sentence meaning and utterance 
meaning parallels the distinction between core word meaning (connotation) and contextual word meanme 
(sense) we discussed in an earlier section. 

In compositional theories of language processing, it is supposed that people first compute sentence 
meaning and then ascertain utterance meaning. It foUows that it should take longer to understand 
sentences contauung expressions Uke kick the bucket when the expressions are used idiomatically than 
when they are used hterally. The reasoning is that it will take time to activate and then reject the literal 
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the sentence Sally cut the cake would not have in mind an abstract representation of the situation which 
was equally applicable to cutting the cake with a lawn mower and with a knife. 

It is true that, with some conscious effort, a person can imagine a representation of the "meanmg of 
the sentence Sally cut the cake that is so abstract as to apply equally wcU to either of these situations 
It seems peculiar, though, to cling to the position that the result of such mental gymnastics has more 
right to be called the meaning of the sentence than what comes spontaneously to the mind of the 
normal reader taking the sentence at face value. 

It might appear that the problem posed for the concept of literal meaning by the preceding example 
could be dealt with simply by postulating multiple senses of the word cut. If there are three senses of 
cut, somethmg like "to mow, as with a lawn mower," "to trim, as with scissors," and "to separate into 
serving portions," then one might be able to equate literal meaning with the normal understanding of 
these sentences. However, such a move would ultimately require an appeal to background knowledge 
m the disambiguation of cut, as we explained at length in our discussion of contextual variation in the 
standard model. 



Another type of example involves sentences which are not fully comprehensible unless one is able to 
hnng to bear extra-linguistic knowledge. In this example, unlike the preceding one, distinctions in the 
^ense of individual words do not play a crucial role. Try to determine the literal meaning of the 
following sentences: 

The haystack was important because the cloth npped. 
The notes were sour because the seams split. 
TJie pr'*^' stalled because the wire straightened. 

These sentences are well known to psycholinguists from the work of Bransford and Franks (1976), but 
if you have not seen them before you probably experienced difficulty in coming up with anything vou 
would want to call the sentence meaning. To be sure, these are especially contrived sentences. But they 
are not defective like Colorless green ideas sleep furiously. The syntax is straightforward. The words are 
used in ordinary ways. And, each of the sentences does have a legitimate and readily understandable 
interpretation. 

Now, consider some clues that provide additional information for interpreting the sentences. The clues 
parachute, bcgpipe, and corkscrew for the three sentences, respectively. If any of the clues enabled 
the "chck of comprehension" for you, the source of difficulty with the sentences ought to be clear: The 
obstacle to comprehension is envisioning possible situations onto which the sentences could map The 
question is whether, if you failed to come up with the defective parachute envisionment, for instance 
you would want to say that you were in possession of the meaning of the haystac^ sentence? We believe 
tne answer is no, but not everyone agrees. 

Let us sharpen the issue. The question is whether sentences have a "zero-context," hteral i; janing- 
that IS, a meaning that does not depend in any way on analysis of the linguistic context, in anv way on 
analysis of the situation, or in any way on world knowledge. The zero- or null-context meaning of a 
sentence is supposed to depend solelv on its wording and syntax (and maybe other linguistic machinery, 
as long as no contextual analysis or world knowledge creeps in). 

Our experience is that educators do not appreciate the force of weird counterexamples against what 
are presented as umversal claims, because they so often have to deal with claims that no one in their 
right mmd would mistake as being universal in scope. The sentences above about haystack and so on 
might be discounted because they are a little weird. We turn, therefore, to the exquisitely simple 
sentence below: ' 



Vie cat is on the mat. 



This sentence has t)cen debated at length by Searle (1979), who upholds the constructivi^r position, 
and Katz (1981), who defends what we are calling the standard position. Katz claims to know the zero- 
context meaning of The cat is on the mat. He says. 
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The sentence . . . has a literal compositional meaning. Unlike Itches dnnk theorems, its selection 
restrictions arc in order. Its meaning is, roughly, that some (contextually specified) cat is 
vertically positioned over some (contextually specified) mat and that the aforementioned cat is 
also positioned so that its bottom is in contact with the top of the mat. (p. 220) 

Katz completely gives away his argument in this passage. Bear in mind that he is not merely suggesting 
the usual meaning of the sentence, or pointing to one among several possible meanings. He is declaring 
the literal meaning based on the sentence's wording alone, with what he alleges is absolutely no reliance 
on context or world knowledge. Thus, to display a legitimate interpretation different fiom Katz's is to 
prove him wrong. And this is easily done. Katz says the sentence means the cat is vertically positioned 
over the mat, but consider the possibility that the mat is being used as a wall hanging and that the cat 
has jumped up and grabbed the mat with its claws. Katz says that the sentence means that the cat is 
also positioned so that it is in contact with the top of the mat, but imagine the possibility of a cat 
balancing on the edge of a rolled, stiff, upright mat standing in a comer. As for the cat, its bottom does 
not need to be in contact with the mat in order to satisfy the sentence; it could be standing or it could 
be lying on its side. 

It is plain to see that what Katz hold up as the zero-context, literal meaning of this sentence depends 
heavily on assumptions about customary positions of mats and usual postures of cats. We assert as a 
general rule that the literal meaning of a sentence is parasitical on assumptions about normal states 
of the world. It follows that there is no such thing as a zero-context meaning for a sentence. 

The idea of literal meaning as the Hrst derivative in language understanUing has had an almost 
irresistible appeal for information processing psycholo^ts committed to the concept of automaticily. 
Basically they argue that a fast, fluent process could not involve much reasoning or complicated choices 
among alternatives. For a good exposition of this position, which goes over some of the same ground 
as this report, see Perfetti and McCutchen (1986). 

For a different reason, the idea of literal meaning also has appeal to linguists. Consider the sentence 
Fiying planes can be dangerous. It has two distinct meanings, at least in some sense of meaning, 
depending on whether one sees planes as the underlying subject or underlying object oi flving. The 
concept of utterance meaning docs not provide the right level of analysis for dealing insightfully with 
ambiguities of this type, however. But if linguists were to adopt a construct of literal sentence meaning 
for talking about such examples, they would have to (we argue) avoid identifying ihat construct with anv 
actual representation of sentences by real listeners or readers. 

We believe that both the assumption of fixed core meanings, which are automatically accessed, and the 
assumption of invariant parsing rules, which are automatically applied, are faulty, for the reasons set 
forth in the preceding sections. This is not to say that word senses do not get accesses or that sentences 
do not get parsed, and that these steps happen fairly early and fairly routinely during sentence 
interpretation. The mistake is to identiiy these transient linguistic throughputs with meaning. 

Why Is The Standard Model So Attractive? 

If the standard model of word meaning is wrong on so many counts, and if there is such a substantial 
consensus among scholars that it is inadequate, why is it so attractive? 

One reason is that the standard model serves as an ideal for scientific terminology, or in fact, 
terminology in any domain in which communication is intended to be as explicit and precise as possible 
In some sense, then, the standard model reflects what many scientists and philosophers wished the 
language were, and tried to make it. Watson and Olson (1987) argue that a standard model of meaning, 
in which words are defined in terms of criterial features, follows naturally from the need for 
communication in a modern, literate society to be successful without personal contact, when there is no 
guarantee of shared assumptions. 

The standard model reflects the time when scholars, at least, still thought of language primarily as 
written language. It might have been acknowledged that word meanings in oral language were vague 
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and variable, but this was looked upon as exactly what good writers would want to avoid. There is of 
course, a long tradition of a prescriptive approach towards language, treating the language not in terms 
of what It IS, h t in terms of a conception of what it should be. It is really only relatively reccmly ihat 
linguists have explicitly focused on oral, rather than written, language as primary. A prescriptive 
approach to language is still firmly entrenched in popular f n jig, as can be seen from the nature of 
the debates about dictionaries, such as the one surrounding iut. publication of Webster's Third. 

The Relationship Between Linguistic And Conceptual Categories 

To some extent, the attractiveness of the standard model, and its failure, can be understood in terms 
of two different ways of thinking about the relationship between language and the world, or more 
precisely, between linguistic and conceptual categories. 

One we could caU the "grid" model. In this model, the relationship between word meanings and the 
v orld can b: thought of as analogous to the imposition of national and state boundaries on a map of 
say, North America. In this analogy, state boundaries represent word meanings, national boundaries 
represent superordinate categories, and the whole map represents the conceptual domain- sav, the 
universe of all animals, or of all colors, or of all emotions. 

Criteriality is a logical consequence of the grid model. It foUows from the fact that the conceptual 
domam is exhaustively categorized. The label given to an entity identifies what category the entity is 
m, but not its position wthin that category. 

An alternative way of thinking might be called the "scattered points" model. Word meanings are 
associated with pomts m a conceptual space in this model. To use another geographical metaphor 
imagine a world m which almost all of the population lives in widely dispersed cities. Although the 
cities are clearly named, the lack of substantial population outside the cities has made it quite 
unnecessary, in most cases, to draw boundaries between them. There is a lot of f'«T:tory that doesn't 
belong to any particular city. When an area near an established city is settled, the region is sometimes 
simply counted a^ part of the city, even though it may not be pnysically contiguous. 

There is no reason to identify criterial features in the second model. There are a hundred ways one 
can differentiate Boston from New York; but there is no need to determine which of them uniquely 
identifies the essence of either city. ' 

There are some cases of naming in which the grid model is better. For examp 'e, a biologist trying to 
come up with terms, say, at the mid levels of a biological taxonomy i" essentially trying to impose a 
grid on a conceptual domain. The terms must be chosen so that chey allow one to exhaustively 
categorize known animals. However, the taxonomy must not be so bound to idiosyncratic properties 
of individual species that the discovery of a new related species would nr-essitate major reorganization. 

More generally, it could be said that the grid model is the standard for scientific terminology especially 
at the level of superordinate terms. The scattered points model, cn the other hand, deals more 
comfortably with basic level terms used in ordinary language: There is maximal similarity amone 
members of a bas;c level category and maximal dissimilarity between members of this cateijorv and other 
categories (Rosch & Mervis, 1975). * ' 

The nature of the relationship between linguist- ar-^ cognitive categories has lonp ' een a matter of 
debate. Whorf (1956) is assoaated with a strong ciaim about this relationship, m which linguistic 
categories are seen as pnmary. According to what j come to be called the Sapir-Whorf hypothesis, 
It IS claimed that languages can differ radically in the \.ay that they categorize and structure the world 
and that linguistic categories determine cognitive categories. Thus, the structure of one's language 
determmes how one perceives and thinks about the world. In sUghtly different terms the linguistic 
system is both an embodiment of the world-view of a culture, and a necessary and suffi -nt means of 
inculcating it in new members. 

Part of the attractiveness of the Sapir-Whorf hypothcis was that it constituted an instd.nt,ation of the 
insight that existing knowledge structures have a profound effect cn cognition and perception, one that 
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predated the most recent '"^arnation of schema theory. The distinctive element of the Sapir-Whorf 
hypothesis, howeve»-the pnma of linguistic structure-has never met wiih wide acceptance. 

But despite the fact that the Sapir-Whorf hypothesis has n er achieved the status of a consensus 
opinion, there is reason to believe that some of the assumptions underlying this hypothesis have 
influenced thinking ab^'it word meanings, and in fact, constitute part of the foundation for the standard 
model of word meaning; in particular, the assumption that experience of the world is inherently 
unstructured or continuous, and that language plays a primary role in structuring and categorizing it 
This idea motivated James' famous description of the infant's experience of the world as a "blooming, 
buzzing confusion." Thus, untU Berlin and Ka/s (1969) work, the rainbow was often taken as a 
metaphor of the relationship between language and the world. The physical reality is an undifferentiated 
continuum; it is language which imposes boundaries. 

The grid metaphor, then, can be seen as flowing out of a way of looking at the relationship between 
language and the world. It is most appropriate in those instances in which word meanings do ser\'e to 
exhaustively categorize some semantic domain-which is sometimes the case, especially for superordinate 
categories. 

The grid metaphor, however, depends crucialh on ^he assumption that experience is essentially 
und'^ferentiated, and that it is language which imposes structure. Recently, Increasing evidence has 
been brought forward that in numerous domains, conceptual structure is independent of or prior to 
linguistic categories (see Au, 1988, for a review of such evidence). For example, in the domain of color, 
although physically a rainbow is a continuum of wavelengths, the human perceptual apparatus in fact 
defines certain focal colors as being perceptually salient. Linguistic systems differ in inlerestmg ways 
in how they categorize colors, but within constraints reflecting the structure imposed by the human 
perceptual system (Berlin & Kay, 1%9). Perceptually, the rainbow is not a continuum; and there is no 
reason to believe that people from different language groups see different rainbows. 

Nelson (in press), Clark (1983), and others have suggested that the extremely rapid rate of earlv word 
growth IS due to the fact that once children have figured out what naming is about, they alreadv have 
^ very rich conceptual structure, and lots of concepts to learn the names for. It may be relevant in this 
regard that Nagy, Anderson, and Herman (1987) found that, in learning new words incidentally whiic 
rcadmg, school children learned words for which they aheady had concepts, but did not learn words 
which represented new conceptSv 

We are not arguing that the grid model never fits the relationship between linguistic and conceptual 
categories. There are domains, as we have mdicated. -nch as biological taxonomies, where exhaustive 
categorization is an important function of language, iiowever, we would argue that meanings are not 
tyvically structured into exhaustive and mutuaJly exclusive categories arranged into taxonomic hierarchies. 

Let's take for an example the average person's knowledge of names for tools. Do the tool nanus 
known by most people exhaustively categorize the universe of tools, or any .ubparts of that universe ' 
Hardly. We can imagine, and we think we have seen, numerous tools which do not fit into any of our 
existing tool categories. Even for people with rich tool vocabularies, it seems likely that a new tool couid 
be invented which would not be a special subcategory of existing tools. 

For most people, if our experience is representative, the vocabulary of tool names contains a 
preponderance of basic level categories (e.g., saw, hammer, dnii), with a few subcategories (hacksaw, 
coping saw), and few if any true supcrordinates. These terms in no way exhaustively categorize the 
universe of possible tools. 

Does this example simply reflect a fact about tools-the fact that linguistic categories can't anticipate 
all possible technological breakthroughs? Let's take a different example, then, the of verbs that 
describe human motion unaided by mech. ucal devices: verbs like walk run, skip, hop, dance, crawl, 
cic. Do such verbs divide possible human motion into a taxonomic hie archy of mutual!/ exclusive and 
exhaustive categories? 
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Not without violence to the normal meanings of the words. Again, there appears iu be a lack of true 
supcrordinates^-for example, is there a label for the category that includes walk and runl The common 
category names do not exhaustively categorize the space of possible human movement. Fur example, 
a cartwheel docs not seem to fit into any more general category, nor docs to walk on one's hands. 

The world is structured because there is a lot of empty space in it. There are few if any tools halfway 
between a saw and a screwdriver. A farm docs not include a continuum of animals ranging from sheep 
to cows. People lalk, and they sing, but in normal life in our culture they usually do one or the other; 
daily experience does not include a lot of things intermediate between the two. 

To conclude, the scattered points metaphor provides a better picture than the grid metaphor of the 
relationships between words and thoughts and objects. Names for things in natural languages have 
redundant, noncritcrial features because there is empty space between categories. 

Definitions and Word Meaning 

Almost all conventional vocabulary instruction involves definitions in some way. Students look up 
definitions, memorize them, select the appropriate one from several alternatives, or arrive at them 
through discussion. Indeed, it is hard to imagine explicit instruction about word meanings that does 
not either begin or end with some sort of statement or description of the mean'ng of the word being 
learned. 

Vocabulary instruction generally involves definitions that reflect many of the characteristics of the 
standard model of word meaning. The very term definition implies criteriality, ab:,tractness, and 
parsimony of statement. Very often, the definitions used in vocabulary in:truction and in glossaries 
are even shorter, more abstract, and less likely to include examples and information about contextual 
constraints than those found in regular dictionaries. Especially in such cases, the theoretical weaknesses 
of the standard model translate into pedagogical shortcomings. 

Definitions and Meanings 

A fundamental fact to be accounted for by any theory of word meaning is that people are, for the most 
part, not able to articulate their knowledge of word meanings quickly or easily (Jchnson-Laird, 1987). 
This fact implies that a distinction has to be made between knowledge of word meanings and knowledge 
of definitions. 

The fact that word knowledge normally does not take the form of definitions is a matter of logical 
necessity in the last analysis. Definitions map words onto other words, and at some point this circularity 
must be broken (Fodor et al., 1980). Familiar words that have been learned from experience in context 
:ire especially difficult to define. When familiar, everyday wo'-ds are ca.^t in definitions, it must be in 
terms of much less familiar words (Johnson-Laird, 1987). 

But even for those words that are more amenable to definition, the distinction between knowing the 
meaning of the word and knowing its definition must be maintained. For most words one knows, 
producing a definition requires a substantial amount of reflection, whereas understanding what the 
word means during reading takes about a quarter of a second (McConkie, Reddix, Sc Zola, 1985). 

Knowing a definition is neither a necessary nor sufficient condition for knowing the meaning of a word 
Children successfully understand and use many words before they have the linguistic or ccgnilive 
sophistication to either produce or understand any sort of formal definition (Watson, 1985). Even for 
adults, the task of producing a definition for a known word is not trivial. 

Nor can the gap between knowing meanings and knowing definitions be attributed simply to problems 
in expressing oneself. When selecting the right alternative on a multiple choice vocabulary test or 
looking up a familiar word in the dictionary, one often has the sense that some insight was ga'ned 
Thus, one can know a word well enough, and still feel that one has learned something by seeing an 
e\plicit definition. This would not happen if the definition already constituted understanding of the 
word. 
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Not onJv is J'juowledgc of a dcfiniiion not a necessary condition for knowing the meaning of a word, it 
i^: »3ot a s.ifPcient on^ either MiUrr and Gildea^s (1987) examples of the sort of sentences clnlJidi 
produce on the basis of definitions make the point forcefully. For instance, given the definition of 
correlate as "to be related ont to the other/ jnc child wrote, "Me and my parents correlate, because 
without them 1 wouidnU be here." That definitions are not sufficient for knowing the meanings of 
words is also forcefully illustrated by the widespread failure of definition-based instruction m word 
meanings to improve comprehension of text containing the instructed words (Graves, 1986; McKeown 
Reck, Omanson & Pople, 1985; Mezynski, 1983; Stahl & Fairbanks, 1986). 

Attributes of the Standard Model ReHected in Definitions 

The chief pedagogical difficulties 'Aith definitions arise from the properties of parsimony and 
abstractness. These properties are a consequence both of the standard model of meaning, and ot the 
uatui-c of dictionaries. Dictionaries are designed as reference works, not as a teaching aids, and the 
practical consideration of Ien,^h limits their informativeness. 

The need to be brief pushes writers and editors of definitions to use very sophisticated and ibslraci 
language. Definitions are commonly shortened by using abstract nouns, which allows stating a predicate 
without specifying the arguments. For example, in a basal glossary, habits is defined as "usual behavior 
rather than "what a person or animal usually does." Although the more abstract wording saves sp^^'-e, 
It diininishes the pedagogical usefubess of definitions. The linguistic devices used in definitions are nc i 
likely to be familiar to many younger and less-able learners, the very ones most likely to need help wii.i 
word meanings. 

Another problem is that a brief definition often simply does not contain sufficient information about 
a word. As we have argued earlier, any attempt ;o reduce the variety of contextual meanings of a word 
by slipulatmg a core meaning results in substantial loss of semantic content and precision. Unless 
illustrative sentences are included, definitions do not provide sufficient inform;»t:on about how a word 
is actually used. And comprehension is no respecter of the line betwe^jn linguistic knowledge and 
general knowledge, wherever one draws that line. Understanding a text always depends on knowledge 
^hat goes beyond any narrow conception of word meanings. 

Tefinitions and Instruction 

Defmitions-even short ones-can make a contribution to the process of word learning. Definitions can 
convp.y information that may not oc available from the natural contexts in which a word appears. Thus, 
learning the definition for a word may serve as a foundation for making more effective use cf 
subsequent encounters with that wora in context, or help in organizing and synthesizing mformation 
'ned form prior encounters. 

However, real word knowledge is less abstract than the information provided in a definition, and 
includes representations of the contexts in which a word appears. This type of word knowledge can 
be inferred from a definition, but usually only if a sufficient number of illustrative contexts are also 
available to the learner. Studies such as those of McKeown et al. (1985) show that intensive vocabulary 
instruction, but not definitional instruction alone, improves text comprehension. These studies 
demonstrate that vocabulary instruction only produces gains in real uses of language when the students 
have engaged in activities t. .t foster the translation of definitions into a working knowledge of mea^^ings. 

The process of arriving at definitions of words through, for instance, class discussion, though it can be 
time-consuming, is probably productive in the long run. One benefit is the insight students gain into the 
process of making explicit knowledge thai has formerly been tacit. But, to reiterate, the ability to 
recognize or even state definitions is a symptom of word knowledge, not its essence. 

In summary, although explanations of the meanings of words are certainly a part of vocabulary 
mstruction, the conventions that constrain traditional definitions Diake them of limited pedagogical 
value. The information in definitions must be supplemented by experience with the words to be 'earned 
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in natural contexts. Teaching wor,1 meanings via definitions alone is analogous to trying to teach 
someone to cook by having them heat TV dinners. 

Beyond Definitions: Treating Word Meanings as Complex, Ill-stnictured Knowledge 

Much of this report has been devoted, in effect, to the point that to truly know the meaning of a word 
is to possess complex and ill-stnictured knowledge. According to Spiro and his colleagues (Spiro, 
Feltovich, & Coulson, in press; Spiro, Vispoel, SchmiU, Samajapungavan, & Boerger, 1987), one aspect 
of ill-struc uredness is the contextual interaction of concepts. For example, the flow of blood in the 
circulatory system depends not just on the properties of the heart, and of the veins, and of the arteries, 
and of the capillaries; it depends cn how these components fit together as a system. Likewise, the 
meaning of a sentence is not a simple compositional function of the core meanings of individual wor^r, 

A second aspect of ill-structuredness is irregularity. In an il-structured domain, knowledge of the 
domain cannot be reduced to a single generalization or organizational scheme. The family-resemblance 
structure of related word meanings is exactly this kind of case. 

Spiro, Coulson, Feltovich and Anderson (1988) argue that advanced knowledge acquisition in a complex 
and ill-structured domain requires viewing the domain from multiple perspectives, a case-based rather 
than an abstraction-based approach, and a commitment to avoiding oversimplification. Spiro and his 
colleagues have been concerned with, for example, the learning of biomedical concepts by medical 
students. However, we want to argue that their formulation is equally applicable to the learning and 
teaching of word meanings. 

In the vocabulary instruction, multiple examples have proved desirable. The research literature shows 
that more effective learning is associated with multiple exposures to a word (Stahl & Fairbanks, 1986). 
Furthermore, in improving comprehension, multiple exposures involving varied contexts, rather than just 
multiple exposures to definitions, seem necessary (McKeown, Beck, Omanson & Pople, 1985). 

Likewise, m vocabulary instruction there much to be said for a case-based rather than abstraction- 
based approach. As we have already suggested, the widespread existence of family resemblance clusters 
ot word meanings indicates that people extend word meanings to new cases on the basis of specific 
knovn cases, rather than on the basis of some more abstract and general representation. Pedagogicaily, 
this principle suggests that instead of definitions being seen as central, and illustrative examples of words 
in context as supplementary, the opposite ought to be the rule. 

In vocabulary learning, as in medicine and the other fields Spiro and his associates have examined, 
oversimplification is an ever present danger, and it is a danger that often arises because of over-reliance 
on abstract generalizations. To borrow another example from Miller and Gildea (1987), given the 
definition of meftcw/owj as Very careful or too particular about small details," a student apparently took 
the meaning to be Very carefui," and wrote the sentence "I was meticulous about falling off the cliff." 

The whole point of vocabulary learning is flexible application-reasoning with, and applying, the material 
learned,'' in Spiro's phrase. Word meanings are learned to serve as tools for comprehension and new 
learning, not simply as facts to be remembered. The truth is that students are seldom called on to apply 
knowledge, say, of the Civil War, other than in answering test questions. Really knowing a word, on 
the other hand, always means being able to apply it flexibly but accurately in a range of new contexts 
and situations. Thus, it can be argued that there is no knowledge addressed in school m which 
application is more crucial than knowledge of word meanings. The challenge for educators is to provide 
instruction of the sort that will lead to flexible application of word knowledge. 

Conclusion 

What we have mainly done in this report is take a close look at the standard theory of word meanings 
We have concluded that every element of the theory is open to serious challenge. No one has yet been 
able to specify a set of semantic features that are atomic, sense-based, and universal in application, that 
provide an exhaustive and satisfying analysis of word meanings, or that give a firm foi»ndation for 
sentence semantics; nor is there good reason to hope that these goak are even achievable. In its very 
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attempt to provide a parsimonious, general account of semantics, standard theory falls woefully short 
of a Unguistically adequate and psychologically realistic characterization of meaning. 

There is general dissatisfaction with standard theory among semantic theorists. Yet this report is, we 
sub-'iit, more than an exercise In beating dead horses. Versions of the standard theory do have current- 
day proponents among scholars concerned with language, notably amocg information prorp<;«;ing 
psychologists trying to account for the speed and fluency with which people process language. More 
important, considering the primary audience of this report, something resembling standard theory seems 
to characterize the conventional wisdom about word meanings in the field of reading education. 

Standard semantic theory seems to provide the tacit foundation for several common practices in 
vocabulary instruction. To the extent that the foundation is weak, the practices may be of dubious 
value. We hope that exposing the assumptions of prevailing semantic theory to close scrutiny will make 
some contribution to raising the level of consciousness of the field, informing debate, refocusing 
research, and informing practice. 

One instructional practice that is widespread is to preteach unfamiliar vocabulary before commencing 
a reading selection. The assumptions seem to be that (a) words have fixed, context-free meanings and 
(b) that the meaning of a text is a compositional function of the meanings of iiS constituent words. As 
we have argued at length, both of these assumption^ are shaky. We believe that apprehending the gist 
of a story is as likely or even more likely to assist in learning the meanings of unfamiliar words than vice 
versa. 

A second widespread practice is reliance on the defmitions in dictionaries and glossaries as sources for 
the meanings of unfamiliar words. But if we are correct, and word meanings are context-sensitive, a 
dictionary is a questionable aid for an inexpert language learner. In fact, independent empirical evidence 
shows that dictionaries often fail to provide children with much help (Miller & Gildea, 1987). 

Finally, there is the practice of attempting to promote vocabulary growth by providing direct instruction, 
drill, and practice on lists of isolated, unrelated words. In another line of research, we have established 
that the size of the vocabulary learning task is simply too large to make much headway teaching words 
1 at a time, 10 at a time, or even a 100 at a time (Nagy <& Anderson, 1984; Nagy, Anderson, & Herman 
1987; Nagy & Herman, 1987; Nagy, Herman, & Anderson, 1985). The theoretical analysis presented 
m this report provides independent, converging reasons for being suspicious about the value of 
mstruction mvolving Usts of isolated words: Word meanings are difficult to capture in the abstract; 
nuance of meaning, especially, depends upon setting and context. 

The w) .s( obvious implication of our analysis is as follows: For enhancement of children's vocabulary 
growtfi and development, there can be no substitute for voluminous experience with rich, natural 
language. 
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